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Abstract 



' The coexistence between different types of templates has been the choice solution to the information crisis of prebiotic evolution, 
p-J triggered by the finding that a single RNA-like template cannot carry enough information to code for any useful replicase. In 
p, principle, confining d distinct templates of length L in a package or protocell, whose survival depends on the coexistence of the 
. templates it holds in, could resolve this crisis provided that d is made sufficiently large. Here we review the prototypical package 
O model of Niesert et al. (1981) which guarantees the greatest possible region of viability of the protocell population, and show that 
^ this model, and hence the entire package approach, does not resolve the information crisis. This is so because to secure survival 
I ^the total information content of the protocell, Ld, must tend to a constant value that depends only on the spontaneous error rate 
^ per nucleotide of the template replication mechanism. As a result, an increase of d must be followed by a decrease of L to ensure 
the protocell viability, so that the net information gain is null. 
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1. Introduction 

The information crisis in prebiotic or chemical evolution, 
as envisaged by Eigen (1971) in the 1970s, is grounded in 
two key observations: (i) the length of a rephcating polymer 
(i.e., RNA-like template) is limited by the fidelity of replica- 
tion (Eigen, 1971), and (ii) templates that differ from each 
other such that their differences reflect on their replication 
fates cannot coexist in a purely competitive setup (Swetina 
and Schuster, 1982). Despite some criticisms about the bi- 
ological relevance of the information crisis (see, e.g., Wiehe 
(1997); Wilke (2005)), the seminal work of Eigen has raised 
true questions about the onset of life, fueling the still un- 
resolved debate on the mechanisms that explain how the 
primeval replicators coped with the loss of information re- 
sulting from competitive exclusion and recurrent muta- 
tions. 

There are two competing scenarios that seek to solve the 
prebiotic information crisis by achieving template coexis- 
tence. In principle, template coexistence solves the infor- 
mation crisis because the total information content of the 
template pool is the product of the number of different 
templates and the maximum information coded per tem- 
plate, provided the template types have the same concen- 
tration. The first scenario is the hypercycle, a closed re- 
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action scheme in which each replicating polymer aids in 
the replication of the next one, resembling a current motif 
in biochemical metabolic pathways (Eigen and Schuster, 
1978). This scheme requires that the primordial replicators 
act both as templates and as cross-catalytic replicases, but 
this key assumption encounters strong criticism. In fact, 
since natural selection is responsive to replication rates 
only, one can expect it to make each element of the hyper- 
cycle a better target for the replicase (or a better replicase 
to itself), but one cannot expect natural selection to drive 
the cross-catalytic activity of the hypercycle components 
in the same way. Therefore, the likely fate of each replicase 
is to mutate into the so-called parasites - molecules simi- 
lar to the modern genomic polymers which lack catalytic 
activity (Smith, 1979; Bresch et al., 1980). In addition, hy- 
percycles with a large number of different templates are 
subject to large fluctuations in the concentrations of its 
members, rendering it susceptible to extinction by stochas- 
tic effects (Nuiio and Tarazona, 1994; Campos et al., 2000). 
Nonetheless, the hypercycle model has received substantial 
empirical support regarding its basic premises as catalytic 
activity in a variety of biological molecules, including DNA 
and RNA, and the experimental synthesis of minimal self- 
replicating systems are now well established (see Paul and 
Joyce (2004)). 

The alternative prebiotic scenario to address the tem- 
plate coexistence issue is the so-called package model 



approach, in which the templates are confined in pack- 
ages termed protocells or prebiotic vesicles (Gabriel, 1960; 
Bresch et al., 1980; Niesert et al, 1981; Niesert, 1987; Sil- 
vestre and Fontanari, 2005; Fontanari et al., 2006). The 
basic assumption here is that a nonspecific replicase is cod- 
ified and assembled by a number d of distinct functional 
templates. Template replication is possible only with the 
aid of the replicase which, in turn, can be put together 
only when all template types are present in the package. 

The combined dynamics of templates and protocells is 
in many aspects similar to the mctapopulation dynamics 
used in the study of patches in ecology (see, e.g., Keymer 
et al. (2000)). The distinguishing features of the package 
models arc the random drift in the template dynamics due 
to the finite capacity of the protocells, and the assortment 
load resulting from the stochastic fission of the protocells. 
These very features that make the dynamics interesting, 
also make it much less amenable to analytic treatment. 
In fact, as compared with the hypercycle and quasispecies 
models for which the deterministic chemical kinetics equa- 
tions suffice to describe most of the relevant phenomena 
(see, however, Alves and Fontanari (1998); Campos and 
Fontanari (1999)), package models like the classical model 
of Niesert et al. (1981) or the stochastic corrector model 
of Szathmary and Demeter (1987) are not so well char- 
acterized and only recently slightly simplified versions of 
the original models were analyzed thoroughly (Grey et al., 
1995; Zintzaras et al. , 2002; Silvestre and Fontanari, 2005; 
Fontanari et al., 2006; Silvestre and Fontanari, 2007). 

Our main reservation about the package models premises 
concerns its group selectionist flavor. One way or the other, 
all aforementioned package models assume that the growth 
rate of the protocell lineage depends on the composition 
of the template population confined within each protocell. 
Although at first sight this may seem a reasonable assump- 
tion, it is not supported by experimental evidence: recent 
experiments with model protocells show that the growth 
of lipid vesicles does not depend on the nature of the sub- 
stances they hold in (Hanczyc et al., 2003; Chen et al., 2004, 
2006; Zaher and Unrau, 2007). A more plausible assump- 
tion is to admit that protocell fission is triggered when the 
total concentration of the templates reaches some critical 
value so that the integrity of the membrane is compromised 
(Chen et al., 2006). 

Interestingly, the package model proposed in the pioneer 
work of Niesert et al. (1981) conforms to this more realistic 
scenario, as no connection is made between template com- 
position and the mechanism of protocell fission. Alas, for 
purely technical reasons - to keep the protocell lineage to 
a size that could be manageable by their computational re- 
sources - those authors have introduced an extraneous in- 
gredient into their model, namely, a prospective value which 
essentially gauges the survival probability of a protocell ac- 
cording to its template composition. This prospective value 
plays exactly the same role as the group selection pressure 
in models where a direct relation between template com- 
position and protocell fission (or protocell reproduction) is 



made explicit (Zintzaras et al. , 2002; Silvestre and Fonta- 
nari, 2005; Fontanari et al., 2006). To handle and quantify 
the unlimited growth of the protocell population we borrow 
a powerful tool from statistical physics, namely, the spread- 
ing or epidemic analysis used to characterize nonequilib- 
rium phase transitions in lattice models (Grassberger and 
de La Torre , 1979). This technique suits particularly well 
to the task of locating the boundaries in the space of pa- 
rameters that separate the subcritical regime, where the 
extinction of the lineage is certain, from the supercritical 
regime, where the probability of survival of the lineage is 
nonzero (see, e.g., Rosas and Fontanari (2003)). 

Since ingredients such as prospective values for protocell 
survival or protocell fitness commonly used in the defini- 
tion of package models are not supported by experimental 
evidence, we find the reexamination of the original version 
of the model of Niesert et al. (1981) absolutely essential. 
More importantly, because viable protocells (i.e., protocells 
that contain the d functional template types) are allowed 
to reproduce unrestrainedly, the region of viability of the 
mctapopulation is the greatest possible in this model. Any 
other package model that assumes protocell competition 
by assigning fitness values to the protocells according to 
their template compositions will exhibit a smaller region of 
viability. 

We have found that the confinement of templates in pack- 
ages does not solve the information crisis of prebiotic evo- 
lution because, to guarantee the viability of the protocells, 
the total information content of each package must remain 
constant, i.e., the product between the template lengths L 
and the number of distinct templates rf is a constant that 
depends, essentially, only on the spontaneous error rate per 
nucleotide. Hence there is no information gain of increas- 
ing the number of coexisting templates types since their 
lengths must decrease proportionally. 

The rest of the paper is organized as follows. In Sect. 2 
we review the package model of Niesert et al. (1981) and 
in Sect. 3 we briefly describe the spreading analysis tech- 
nique. The results are presented in a crescendo of difficulty 
from Sects. 4 to 6. In particular. Sect. 4 is dedicated to the 
study of the limit of perfect replication fidelity; Sect. 5 con- 
siders the eflfect of mutations to the so-called parasite class, 
which are essentially nonfunctional templates; and Sect. 6 
the effect of mutations to the lethal class, which are precip- 
itating agents that prevent the assemblage of a replicase. 
Finally, Sect. 7 is devoted to our concluding remarks. 

2. The package model of Niesert et al. (1981) 

According to the model of Niesert et al. (1981) we con- 
sider a metapopulation composed of a variable number 
of packages or protocells, each of which encloses a cer- 
tain number of templates (RNA-like molecules) and a few 
polypeptides. Since all our simulations begin with a sin- 
gle mutant-free protocell we use the words metapopula- 
tion and protocell lineage interchangeably throughout this 



paper. There are d distinct types of functional templates 

which, when present in the same protoccU are capable of 
assembling a nonspecific replicase of finite processivity and 
finite fidelity of replication. As pointed out by Niesert et al. 
(1981), this kind of primitive translation is achieved by co- 
operation of RNAs with t-RNA character and an RNA with 
messenger function. Here we define the processivity A of 
the replicase as the number of template copies it produces 
in some convenient unit of time, so A can be viewed as a 
measure of replication efficiency as well. This definition can 
be made equivalent to the usual definition of processivity 
as the average number of nucleotides added by a replicase 
per association/disassociation with the template if A is al- 
lowed to take on nonintegcr values and all templates have 
the same length. We recall that the replicase can only be 
formed in the presence of all d types of template. If a pro- 
tocell lacks a single functional template type it is consid- 
ered unviable and discarded from the protocell population. 
In addition, all template types display identical targets to 
the replicase, which results in a neutral replication process, 
i.e., all template types have the same replication rate. 

Since any plausible replication process is susceptible to 
errors, we must take into account the possibility of muta- 
tions. Two types of mutations are allowed in the model. 
First, there are mutations that produce non-functional 
templates which contribute nothing to the assembling 
of the replicase but keep their replication capability un- 
harmed. These mutants are termed parasites. Second, 
there are lethal mutations which generate molecules that 
prevent the assembling of a replicase or, equivalently, pro- 
duce a non-functional replicase. In any case, an offspring 
protocell that contains a mutant of this kind is rendered 
unviable. Mutations to the parasite type occur with prob- 
ability u whereas mutations to the lethal type occur with 
probability v. Reverse mutations or mutations between 
functional template types are not considered. The life cy- 
cle of the protocells consists of three events - template 
replication, protocell fission and protocell demise - which 
take place in this order and are described in the following. 

2.1. Template replication 

This process, which describes the replication of the tem- 
plates inside the protocells, is regulated by one of the cru- 
cial parameters of the model, namely, the number of repli- 
cated molecules between two vesicle fissions (A). As already 
pointed out, this quantity is the processivity of the repli- 
case, i.e., the number of template copies the replicase can 
produce in a unit of time, taken here as the time between 
two consecutive fissions. The replication process is imple- 
mented as follows. For each protocell we choose a template 
at random and replicate it, returning both the original tem- 
plate and its copy to the protocell, and this procedure is 
repeated until the processivity of the replicase is reached, 
i.e., it is repeated exactly A times. If the randomly selected 
template is a functional template then the copy will become 



a parasite with probability zt or a lethal with probability v. 

Hence the probability that the copy is perfect is 1 — u — v. 
If the selected template is a parasite, then the copy will 
become a lethal with probability v. Finally, in the case a 
lethal is selected, then the copy will also be a lethal. As a 
result of this procedure, exactly A new templates are added 
to the protocell. We note that in the absence of mutations 
this process is exactly the Polya's urn scheme (Feller , 1968, 
Ch. V.2). The template replication procedure is repeated 
for all protoc;ells in the mctapopulation. In what follows we 
will refer to the number of templates inside a protocell as 
the size of the protocell. 

2.2. Protocell fission 

After template replication, the protocell is in excess of 
molecules since; \vv assume; that the membrane is imperme- 
able to templates. As template replication is much faster 
than protocell fission, we envisage a scenario where fission 
begins at about the same time as the template replication 
cycle but terminates only after the replicase has exhausted 
its copying capacity. This means that the overall osmotic 
pressure tolerated by the protocell membrane is of the or- 
der of A, so that the fission process is triggered whenever 
the protocell size exceeds A. For example, if the size of a 
just generated daughter protocell is n > A then the fission 
process starts but, by the time it is concluded, the protocell 
has already reached the size n + A. 

Experimental models of lipid vesicles exhibit a variety of 
mechanisms for the fission process (Hanczyc and Szostak, 
2004). Here we choose a mechanism that splits the mother 
protocell in two daughters and distributes templates of the 
mother vesicle between the two offspring at random. The 
random assortment of templates to the daughter vesicles 
may cause the loss of essential genes for survivorship (i.e., 
the assortment load), thus producing unviable protocells. 
To be more precise, if S is the size of the mother protocell 
just before fission then the probability that the daughters 
have size s and S — s is given by the binomial (^)2~'^. 

2.3. Protocell demise 

The viability of a daughter protocell is guaranteed pro- 
vided that (i) it carries no lethal mutants, and (ii) it has 
at least one copy of each type of functional template. Any 
protocell lacking one of those templates or carrying a lethal 
mutant is dismissed because it is incapable of producing a 
working replicase. 

2.4. Mctapopulation dynamics 

The sequence of the three processes stated above defines 
the time unit of the metapopulation dynamics. ^From these 
processes it is clear that the size of the metapopulation 
(i.e. the number of protocells) can, in some cases, increase 
without bounds. In fact, the metapopulation dynamics can 



be viewed as a branching process with denumerably many 
types (Kimmel and Axehod , 2002). As pointed out before, 
to circumvent this 'technical' difficulty which rapidly sat- 
urates the computer resources of the time, Niesert et al. 
(1981) have opted to discard supernumerary protocells ac- 
cording to an arbitrary prospective value which essentially 
gauges the odds of a protocell to leave viable descendents 
(sec Sect. 5). In practice, using such a prospective value to 
eliminate the least promising protocells introduces an ad- 
ditional criterion for protocell demise, which can be inter- 
preted as a selection pressure acting at the protocell level 
which favors protocells with high prospective values. Since 
the survival probability of the protocell begins to depend 
on specific details of their template compositions, the re- 
sulting model becomes very similar to the group selection 
models proposed to explain template coexistence (see, e.g., 
Fontanari et al. (2006)). Interestingly, Niesert et al. (1981) 
do not mention group selection or multi-level selection: al- 
though they state that the unit of selection is the entire 
package, they overlooked the conflict between package and 
template interests, especially the parasites. Here we do not 
use any prospective value to control the metapopulation 
size and the only criteria for elimination of a protocell is 
the presence of a lethal mutant or the lack of any type of 
functional template. 

Although the number of protocells may grow unbound- 
edly, the number of templates inside each protocell (i.e., 
the protocell size) fluctuates around a finite value, though, 
strictly, a viable protocell can assume any size value in the 
range d, . . . , oo. In fact, starting from whatever conditions 
the average number of templates per package will approach 
the processivity of the replicase very quickly. Consider, for 
example, the average number of templates rit in a protocell 
at time t. It is related to the average number of templates 
of its mother protocell rit-i by the recursion equation 

{nt-i + A) 



nt = 



whose solution is simply 



no - A 
nt = + A, 



(1) 



(2) 



where no is the size of the vesicle that originated the lin- 
eage. Hence, regardless of the ancestor size, after a few gc!n- 
erations the average size of the protocells will equal the 
processivity value of the replicase. 

The ultimate goal of the analysis of the metapopulation 
dynamics is to determine the regions in the space of the pa- 
rameters d (number of functional templates) , A (processiv- 
ity of the replicase), u (probability of mutation to parasites) 
and V (probability of mutation to the lethal type) where 
the protocell lineage has a nonzero probability of thriving. 
To achieve that we resort solely to brute force simulations 
of the metapopulation dynamics as will be described in the 
forthcoming sections. A slightly modified version of this 
model, in which template replication takes place simulta- 
neously through a Wright-Fisher process and the daughter 
protocells have fixed size A, yields to an analytical approach 



(Silvestre and Fontanari, 2007). In this contribution, how- 
ever, we have opted to stick to the original model of Niesert 
et al. (1981) so as to complement and complete that clas- 
sical work on the theory of prebiotic evolution. 

3. The spreading einalysis 

The spreading or epidemic analysis put forward by Grass- 
bcrgcr and dc La Torre (1979) is the preferred technique of 
statistical physics to locate and characterize equilibrium as 
well as non-equilibrium phase transition lines. The method 
is tailored to population dynamics problems that exhibit 
an absorbing state (subcritical regime), where the proto- 
cell population has gone extinct, and an active state (su- 
percritical regime) where the population undergoes expo- 
nential growth. Separating these two regimes there is the 
critical regime where the population growth (if any) is sub- 
exponential, usually a power law in the time variable. 

The basic idea of the spreading analysis is to follow the 
evolution of the lineage that sprouts from a single mutant- 
free protocell of size A, in which the functional templates 
are evenly represented. For each time t we carry out from 
10^ to 10® independent runs, all starting with the same an- 
cestor protocell. The number of runs depends on the values 
of the control parameters, and is chosen so as to guaran- 
tee a representative number of surviving samples at any 
given time. Here wc focus on the time dependence of two 
key quantities: the average number of protocells Nt and the 
survival probability of the lineage Pt calculated at time t. 
Clearly, Pt is simply the fraction of runs for which there is 
at least one protocell in the lineage at time t. We note that 
in the calculation of Nt we take an average over all runs, in- 
cluding those for wich the lineage has already gone extinct 
at time t. 

Let A be a real variable that measures the distance to 
the critical point. For example, if the transition from the 
absorbing to the active phase takes place at the mutation 
probability u = Uc when all other parameters are held fixed 
then A = w,,, — u. Close to this transition point, i.e. , for A « 
0, and for sufficiently large t, we expect that the average 
number of protocells and the survival probability obey the 
scaling hypothesis 



(3) 
(4) 



where r], 5 and u are non- negative critical dynamic expo- 
nents, and (ppf and (f)p are universal scaling functions, usu- 
ally exponential functions (Grassberger and de La Torre 
, 1979). At the critical point A = 0, a log- log plot of Nt 
(or Pt) as function of t yields a straight line, the slope of 
which yields the dynamic exponent rj (or S). Upward and 
downward deviations from the critical straight line indicate 
supercritical and subcritical behaviors, respectively. The 
mere observation of these deviations allows a very precise 
estimate of the value the critical parameter Uc- 
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A particularly important quantity, as far as the charac- 
terization of branching processes is concerned, is the ulti- 
mate probability of survival of the lineage, defined as 

n = lim Pt (5) 

t — >oo 

so that n = in the critical and subcritical regimes (u > 
Uc), whereas 11 > in the supercritical regime {u < Uc)- 
In the latter case, for u sufficiently close to Uc we have 
another scaling relation, 11 ~ A'^ where /3 is a static critical 
exponent. Since Eq. (4) holds for f — > oo in the sub as well 
as in the supercritical regime the critical exponents (5, v and 
(3 must not be all independent. To see that we rewrite Eq. 
(4) in the supercritical regime as Pt ~ H^'^^if^p (A'^i) where 
tpp{x) = x~^<j)p{x) is an arbitrary function that tends to 
a finite positive value when x ^ oo. It follows then that 
/3 = Sv. The precise estimate of these exponents is a major 
concern of a branch of the statistical physical that deals 
with branching-like processes in lattice models (see, e.g., 
Marro and Dickman (1999)). The branching process we 
consider here can be viewed as a mean-field-like version 
of similar processes in lattice models and so the critical 
exponents are the mean-field ones: t] ~ 0, S — —1, and 
u = (3 — 1. The a priori knowledge of these values can be 
useful to validate our estimate of the threshold locations, 
which are actually our main concern in this paper. 

4. The cost of diversity 

The information crisis of prebiotic evolution revealed by 
the quasispecies model is essentially a consequence of the 
limited fidelity of replication of the templates with the re- 
sulting steady accumulation of mutants in the population 
(the mutational load in the population genetics jargon). 
In package models there is an additional threshold phe- 
nomenon which, even in the absence of mutations, limits 
the amount of information stored in the molecular pool: 
the limited processivity of the replicase bounds the num- 
ber of distinct template types that can coexist in a pro- 
tocell. We refer to this new information crisis as the cost 
of diversity. Deterministic models, such as the quasispecies 
and the hypercycle models, assume that the processivity of 
the replicase is essentially infinite but a recent estimate of 
this parameter in artificially selected RNA polymerase ri- 
bozymes yields a processivity on the order of a few tens of 
nucleotides (Zahcr and Unrau, 2007), which is insufficient 
to produce even the shortest functional ribozyme known, 
the hairpin ribozyme (Kun et al., 2005). To better char- 
acterize the processivity threshold in this section we focus 
in the case in which the replication fidelity is perfect, i.e., 
u = V = 0. 

Although some features of the primordial replicase such 
as its specificity and processivity are key elements in most 
prebiotic scenarios, their roles are rarely punctuated in the 
literature. Notable exceptions are Michod (1983) who has 
argued that the lack of specificity of the primordial repli- 
case gives the templates an altruistic-like character which 
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Fig. 1. Average number of protocells in a lineage originated in a 
single well-balanced and mutation-free founder. The parameters are 
d = 4 and (from bottom to top) A = 4, . . . , 10. The data shown 
for each t represents the average over 10^ independent runs. The 

semi-logarithmic scale facilitates the observation of the exponential 
increase and decrease of the lineage size in the sub and supercritical 
regimes. 




Fig. 2. Same as Fig. 1 but for the survival probability. For large t this 
quantity tends to a constant value, the ultimate survival probability 
n. The minimum value of the processivity A that guarantees n > 

is Ac = 9 in this case. 

could be maintained by the confinement of the templates 
in packages, and Niesert et al. (1981) who have, in turn, 
shown that a high processivity is necessary to make up for 
the assortment load and so to guarantee the coexistence of 
a few functional templates in the protocell. 

To obtain the minimum value of the processivity A of a 
replicase that needs d functional templates for its assem- 
bling we use the spreading technique as illustrated in Figs. 
1 and 2 for the case (1 = 4. Given the discrete nature of A, 
the critical regime is absent in this case and so one observes 
a discontinuous jump from the typical subcritical behavior 
(both Nt and Pt vanish exponentially with increasing t) to 
the supercritical regime (Nt increases exponentially with 
increasing t whereas Pt approaches the ultimate survival 
probability 11). We recall that the calculation of Nt includes 
the extinct runs as well and this is the reason that Nt can 
take on values smaller than 1 in the subcritical regime (note 
that A^o = !)• 

Repetition of the spreading analysis for different values 
of the diversity d allows us to obtain the dependence of the 
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Fig. 3. Logarithmic plot of the minimum value of the processivity Ac 
above which the protocell Uneage has a finite probability of escaping 
extinction against the diversity d. The extinction of the lineage is 
certain for A < Ac. The solid line is the fitting Ac = cP/2. The 
parameters axe u = v = 0. 

minimum processivity Ac with d. The rcsuhs are summa- 
rized in Fig. 3, which essentially shows that for d not too 
small the data is described very well by the fitting Ac = 
For A < Ac, the replicase is unable to produce suffi- 
cient copies of the templates to compensate for the stochas- 
tic loss of functional template due to the assortment load, 
and so the lineage goes extinct with probability one. For 
A > Ac, the lineage has a nonzero probability of ultimate 
survival. 

As pointed out by Silvestrc and Fontanari (2007), there 
is a simple combinatorial argument to explain the scaling 
Ac ~ at the critical boundary in the error- free replication 
case, that goes as follows. Assuming that the typical size 
of the protocells is A then the number of different types of 
viable protocells is 



whereas the total number of protocell types is 

The logarithm of the ratio r between these two quantities 
can be written as 

d—l . / . ^ d—1 ,9 

lnr=> In- ^~-t7 r> (8 

i=l ' i—1 

from where we can see that the only way to obtain non- 
trivial values of this ratio (i.e., r 7^ 0, 1) for large A and d 
is to assume that d^ / K remains of order of 1 , which yields 
r ^ exp (— d^/A) . Clearly, r corresponds the subcrit- 
ical regime since the fraction of viable protocell types is 
vanishingly small in this case, whereas r 1 corresponds 
to the supercritical regime as practically all protocell types 
are viable. 

The relation Ac ~ d^/2 is the same as that found in the 
original work of Niesert et al. (1981) as well as in the vari- 
ant of Silvestre and Fontanari (2007), and it poses another 
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Fig. 4. Asymptotic growth rate of the lineage as function of the 
replicase processivity in the error-free case for (from top to bottom) 
d = 4, . . . , 10. The solid lines are the fittings ^ = ln2 -|- aA^ with 
6 < 0. The value of A above which ^ becomes positive defines the 
limits to the realm of viability of the lineage. The horizontal lines 
are ^ = and J = In 2. 

challenge to the primordial protocells, for which no mech- 
anism of segregation can be assumed to exist: the need to 
overcome the efficiency lower bound Ac in order to main- 
tain a given number of distinct functional templates. This 
threat to prebiotic evolution was aptly termed Charybdis 
by Niesert et al. (1981) in reference to the monster of the 
Greek mythology. 

There is yet another difficulty that passed unnoticed in 
previous works and that makes the diversity cost transpar- 
ent. From Fig. 1 we can easily see that Nt ^ exp (^i) for 
large t where ^ = ^ (A, d) is the asymptotic growth rate of 
the metapopulation, shown in Fig. 4. The point here is that 
for A fixed, the growth rate decreases with increasing diver- 
sity, which means that there will be a selective pressure to 
increase A while d is kept to some minimum possible value. 
A way out of this conundrum would be to assume that the 
processivity of the replicase depends somehow on the in- 
formation content of the template pool (i.e., A = A (d)) so 
that it would be impossible for a system of d template types 
to assemble a replicase with processivity A Ac ~ d^ /2. 

5. The parasite load 

To better realize the implications of an imperfect repli- 
cation mechanism, we will focus first on the mutations to 
the parasite type only, i.e., m > but v = {]. The accu- 
mulation of parasites in a protocell does not make it un- 
viable, but in the long run it can reduce the efficiency of 
the replicase, leading to the underproduction of functional 
template copies that could counterbalance the disruptive 
effect of the assortment load. (We recall that the parasites 
have the same replication rate as the functional templates). 

Since the mutation probability u varies continuously in 
the range [0, 1], in this section we can finally appreciate the 
effectiveness of the spreading analysis to locate the critical 
mutation probability Uc that separates the sub and super- 
critical regimes. Accordingly, in Figs. 5 and 6 we show log- 




Fig. 5. Logarithmic plot of the average number of protocells in a 
Uneage for several values of the parasite mutation probability (top 
to bottom) u = 0.180,0.181, . . . ,0.190. The critical mutation prob- 
ability is Uc s» 0.185 and corresponds to the horizontal straight line 
{rj = 0). The parameters are d = 2, A = 2, and v = 0. 
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Fig. 6. Same as Fig. 5 but for the survival probability. The dashed 
straight line is the fitting Pt ^ of the critical curve. 

log plots of Nt and Pt for values of u close to Uc- Visual in- 
spection of Fig. 5 leads to the estimate Uc = 0.185 ± 0.001 
for case = A = 2. As pointed out before, in the subcrit- 
ical regime one has Nt ~ exp {^t) with ^ ^ — | A j'^. The 
time decay constant ^ is calculated by replotting Fig. 6 in 
the semi-log scale (see Fig. 2 for a similar graph) and then 
fitting the straight lines that result from this scale transfor- 
mation. Finally, Fig. 7 confirms the linear dependence of ^ 
on I A I, and so the exponent value v = I. We conclude then 
that the ultimate survival probability vanishes as 11 A 
when the critical mutation probability is approached from 
the supercritical regime. These results assure us of the re- 
liability and adequacy of the spreading technique to locate 
the mutation threshold in the package model of Niesert 
et al. (1981). This technical procedure was repeated for dif- 
ferent values of the control parameters d and A in order 
to obtain a complete phase diagram of the model. Finally, 
we note that in this scheme we have imposed no limitation 
whatsoever to the total protocell population size. 

Fig. 8 summarizes the dependence of the critical muta- 
tion probability Uc on d and A. We recall that for u > Uc 
the lineage is unviable, i.e., 11 = 0. For low diversity, the 
introduction of parasites has practically no effect on the vi- 




Fig. 7. Logarithmic plot of the absolute value of the time decay 
constant ^ against | A | for the data shown Fig. 6. The slope of the 
straight line used to fit the data is = 1.01 ± .01. 
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Fig. 8. Critical mutation probability to the parasite class Uc as func- 
tion of replicase processivity A for values of the template diversity d 
indicated in the figure. Mutations to the lethal class are no permit- 
ted (v = 0). 

ability of the lineage. Solely when the mutation probability 
takes very large values then the harm caused by the para- 
sites becomes appreciable and must be compensated by the 
increase of the replicase processivity A. For high diversity, 
however, the fate of the lineage is much more sensitive to the 
presence of parasites, meaning that metapopulations with 
a high diversity of functional templates must have a very 
efficient replicase right at the beginning, which happens to 
be the situation of modern organisms. All polymerase cores 
involved in replication have similar processivity values, re- 
gardless of the number of genes per genome, which can vary 
from few hundreds genes in the smallest prokaryotes to tens 
of thousands in the largest eukaryotic genomes (Benkovic 
et al., 2001). We note, however, that the more sundry viral 
polymerases do not follow this scenario. 

A salient feature of the metapopulation dynamics re- 
vealed by Fig. 8 is that, for fixed d, there is a value of the 
mutation probability beyond which coexistence is impossi- 
ble regardless of the value of the replicase processivity. This 
is so because Uc tends to the well-defined value 1 /d in the 
limit A ^ oo. Nevertheless, contrary to the claim of Niesert 
et al. (1981), increasing A is always benefic to the lineage: 
there are no 'mutational reefs' or Charybdis' partner - the 
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Fig. 9. Same as Fig. 8 but using the prospective value of Niesert 
et al. (1981) to discard supernumerary protocells. This additional 
ingredient leads to the incorrect prediction that high processivity 
values can be deleterious to the protocell population, in the sense of 
curtailing its region of viability. 

monster Scylla - awaiting for the metapopulation. In fact, 

as pointed out before, their conehision was a resiih of the 
prospective value used to Umit the metapopulation size, 
which was based on three properties, namely, the degree of 
eqiiipartition of the copies among the different functional 
templates, the number of parasites and the overall redun- 
dancy of the functional templates. This claim is confirmed 
by the result of the simulation of the metapopulation dy- 
namics using the prospective value of Niesert et al. (1981) 
as shown in Fig. 9, from where we see that Uc increases to- 
wards a maximum and then decreases towards zero as A 
increases further. Rather surprisingly, the introduction of 
an explicit group selective pressure towards more balanced 
protocells turns out to be harmful for the metapopulation, 
which now can stand much lower values of the mutation 
probability. The reason is that for large A and not too low u, 
the surviving vesicles in the supercritical regime are heav- 
ily loaded with parasites and so use of such selection cri- 
terion would purge them from the population resulting in 
the premature extinction of the lineage. 

6. The role of lethal mutants 

Lethal mutants are precipitating agents, the action of 
which disrupts the ability of the set of functional templates 
to assemble the replicase. Accordingly and following Niesert 
et al. (1981) we assume that the activity of the lethal mu- 
tants comes about only after the fission of the mother pro- 
tocell, so that only the daughter protocells are affected by 
this type of mutant. This is a best case scenario, since there 
is a chance that one of the daughters comes out free of lethal 
mutants. 

Since the mutation probability to the lethal class w is a 
real variable, we can use the spreading technique to locate 
the critical value Vc that separates the sub and the super- 
critical regimes. The main graph of Fig. 10 summarizes our 
results in the case that mutations to the parasite class are 
forbidden (u = 0). In this case it is clear that too high pro- 



Fig. 10. Critical mutation probability to the lethal class as function 
of the replicase processivity in the absence of parasites (u = 0) for 
the values of the template diversity d indicated in the figure. For 
large A we find Vc 1/A (dashed straight line) regardless of the 
value of d. The inset show Vc vs. A for d = 2 and (top to bottom) 
u = 0,0.2 and 0.4, indicating that the asymptotic scaling Vc ~ 1/A 
is independent of u as well. 
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Fig. 11. Critical mutation probability to the parasite class as function 
of the replicase processivity for d = 2 and v as indicated in the figure. 
The upper bound on A is due solely to the effect of lethal mutants, 
whereas the lower limit is due to the assortment load. 

cessivity values are harmful to the metapopulation. A com- 
promise between the 'fluctuation abyss' (loss of functional 
templates due to the assortment load) and the 'mutational 
reefs' (production of lethal mutants) is achieved by a fi- 
nite value of A, which corresponds to the maximum in the 
curves of Fig. 10. It is interesting that Vc becomes practi- 
cally independent of d in the limit of large A, where the 
data is very well fitted by the function Vc ~ 1/A, which es- 
sentially means that the average number of lethal mutants 
must be smaller than 1 in any viable lineage. In fact, the 
relation WcA w 1 holds in the case mutations to the parasite 
class are allowed as well, as shown in the inset graph of Fig. 
10. The only effect of w > is to shift uniformly Vc towards 
lower values. The important point illustrated by Fig. 10 is 
that for fixed v the metapopulation is viable only between 
a certain range of processivity values, which excludes too 
small as well as too high values of A. This result is better 
illustrated in Fig. 11 which shows the dependence of Uc on 
A for V and d fixed. 

We note that the presence of parasites does not add any 
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qualitative new feature to the model dynamics: the thresh- 
old phenomenon for low A is due to the assortment load 
only, whereas the threshold for high A results solely from 
the destructive activity of the lethal mutants. This is in 
stark contrast with the major role played by u in free- 
gene models, as evidenced by the error-threshold in the de- 
terministic quasispecies model (Eigen, 1971) and MuUer's 
ratchet in finite population models (Haigh , 1978). In fact, 
the analogue of the error threshold transition is the lim- 
iting value Uc ^ 1/d for large A (see Fig. 8) which is ir- 
relevant here because the presence of lethal mutants pre- 
vents the evolution of rcplicases with such high processiv- 
ity values. Nonetheless, the existence of an error threshold- 
like transition between viable and unviable states with re- 
spect to lethality contrasts with the absence of thresholds 
in the presence of lethal mutants in the quasispecies model 
(Wilke, 2005) and corroborates the findings of Takeuchi 
and Hogeweg (2007). The reason the steadily accumulation 
of parasite mutants does not lead to a phenomenon simi- 
lar to the MuUer's ratchet is that we have assumed a trim- 
cated fitness landscape at the protocell level, so protocell 
containing only parasites cannot accumulate. In any event, 
since the metapopulation can undergo unrestrained growth 
in the supercritical regime, the accumulation of parasites 
is not a serious hindrance as it is in the case the metapop- 
ulation size is fixed (see, e.g., Fontanari et al. (2003)). 

7. Conclusion 

The reexamination of the classic package model of 
Niesert et al. (1981) has revealed a few novel features 
and clarified some aspects of the metapopulation dy- 
namics, which were obscured in the original work by the 
introduction of an artificial prospective value to limit the 
metapopulation size. To circumvent this difficulty, we have 
resorted to the powerful spreading technique of statistical 
mechanics which, by focusing on the time dependence of a 
few quantities, permits the precise location of the param- 
eter values bordering the sub and supercritical regimes 
of population growth. In addition, our computational re- 
sources allow us to handle metapopulation sizes of order 
of 10^ packages, a figure that would be unthinkable in the 
beginning of the 1980s. 

In the case of finite processivity A, which is the situation 
of interest in package models, we find that the parasites 
have no significant role in the evolution of the metapopula- 
tion and, in particular, mutations to the parasite class do 
not set an upper bond to the values that A can assume. This 
is in disagreement with the findings of the original analy- 
sis (see Figs. 8 and 9). In fact, the phenomena responsible 
for the scenario 'life between Scylla and Charybdis' are the 
assortment load (i.e., the random assignment of templates 
to the daughter protocells) which sets a lower bound to A 
and the presence of lethal mutants which, in turn, set the 
upper bound to A (see Figs. 10 and 11). 

An important result, which seems to have been com- 



pletely overlooked by Niesert et al. (1981), regards the ap- 
parently unproblematic situation where the replication fi- 
delity is perfect, i.e., u = v = Q. The difficulty is illus- 
trated in Fig. 4 which shows that the lower the diversity, 
the higher the growth rate of the metapopulation. So, in 
the case that two such mctapopulations are set to compete 
in the same environment, it is evident that the lineage with 
higher diversity will inevitably be excluded. The situation 
becomes even worse in the presence of parasites and lethal 
mutants, since these mutants tend to cause more harm to 
the lineages with high div(Tsity (sec> Figs. 8 and 10). This is 
a very unpleasant situation, the solution of which requires 
ad hoc assumptions about the protocell fitness (e.g., a fit- 
ness that increases with the template diversity) or about 
the dependence of A on d in order to revert the negative 
scenario revealed by Fig. 4. 

A word is in order about the amount of extra informa- 
tion eked out by confining the templates in packages. Here 
we equate information content with template length. The 
fundamental quantities here are the spontaneous error rate 
per nucleotide e, the molecule length L 3> 1, and the frac- 
tion of neutral single nucleotide substitutions A (see, e.g., 
Takeuchi et al. (2005)) from which we can readily obtain the 
probability that a template copy becomes nonfunctional 
(i.e., a parasite or a lethal), 

^ = u -I- = 1 - exp [-e (1 - A) L] . (9) 

A generous estimate of these parameters (see, e.g., Drake 
and Holland (1999); Johnston et al. (2001); Kun et al. 
(2005)) is e ~ 0.005, L ~ 200 and A ~ 0.25 which yields 
ji ~ 0.53. This is a truly disastrous result which precludes 
the coexistence of even two templates. Alternatively, we can 
use Eq. (9) to calculate the maximum length Lm of a tem- 
plate in a best case scenario where lethal mutants are ab- 
sent V = Q and the replicase processivity is infinite so that 
Uc « l/d. The result is = - [In (1 - 1/d)] / [e (1 - A)], 
which, using the same parameter values as before, yields 
Lm ~ 77 for d = 4, resulting thus in a total of about 300 
nucleotides. Considering that this is a best case scenario, 
the improvement over the free-gene situation (about 200 
nucleotides) is scanty. For instance, for large d we have 
Ljnd « 1/ [e (1 — A)] which is essentially a constant value 
that depends on environmental factors only. Hence there is 
no information gain by increasing d since the length L^. of 
the coexisting templates must decrease in the same propor- 
tion so as to keep the total information content (i.e., Ld) 
constant. A similar 'information conservation principle' is 
likely to hold for the hypercycle model as well. We recall 
that in the case of the elementary hypercycle, the maxi- 
mum number of templates that can coexist in a dynamical 
equilibrium state is d = 4 (Eigen and Schuster, 1978). The 
analysis of the error propagation in the hypercycle (Cam- 
pos et al., 2000) is not useful here because the error tail 
(mutant) class considered in that work is not equivalent to 
the parasite class of the package models, since those mu- 
tants do no receive catalytic support from the hypercycle 
members. 



The original package model proposed by Niesert et al. 
(1981) and reviewed here is truly extraordinary for a sim- 
ple reason, which perhaps passed unnoticed even to their 
proponents: because the protocells do not compete and the 
lineage is free to grow unrestrainedly, the region of viability 
of the metapopulation in this model is the greatest possi- 
ble! Introducing the notion of protocell fitness or prospec- 
tive survival value that depends explicitly on the template 
composition of the protocells, regardless of the prescription 
one chooses, can only reduce the size of that region. This 
is the reason that the critical imitation probability in Fig. 
8 is greater than its counterpart in Fig. 9. This moans that 
our estimates for the critical mutation probabilities Uc and 
Vc are upper bounds to the critical mutation probability of 
any package model for which the survival of the protocell 
is linked to the coexistence of a set of functional templates. 
Therefore, since even in this best case scenario the infor- 
mation gain derived from the coexistence of the distinct 
templates is not significant, this class of models should be 
discarded as possible solutions to the prebiotic information 
crisis. This is not to say that packages are not important 
in prebiotic evolution, since it is undisputable that com- 
partments are essential to the evaluation of the translation 
products of the information coded in the templates (Eigen 
et al. , 1980). 

Coming up with a coherent scenario to explain the coex- 
istence of distinct templates has proved to be a most dif- 
ficult endeavor and it may already be time to turn to new 
approaches to solve the information crisis of prebiotic evo- 
lution. We have nothing to offer on this direction. 
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